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Abstract — Rapid advances in image acquisition and storage 
technology underline the need for algorithms that are capable 
of solving large scale image processing and computer-vision 
problems. The minimum cut problem plays an important role 
in processing many of these imaging problems such as, image 
and video segmentation, stereo vision, multi-view reconstruction 
and surface fitting. While several min-cut/max-flow algorithms 
can be found in the literature, their performance in practice 
has been studied primarily outside the scope of computer vision. 
We present here the results of a comprehensive computational 
study, in terms of execution times and memory utilization, of 
the four leading published algorithms, which optimally solve 
the s-t cut and maximum flow problems: (i) Goldberg's and 
Tarjan's Push-Relabel; (ii) Hochbaum 's pseudoflow; (iii) Boykov's 
and Kolmogorov's augmenting paths; and (iv) Goldberg's partial 
augment-relabel. Our results demonstrate that while the augment- 
ing paths algorithm is more suited for small problem instances 
or for problems with short paths from s to t, the pseudoflow 
algorithm, is more suited for large general problem instances and 
utilizes less memory than the other algorithms on all problem 
instances investigated. 

Index Terms — Flow algorithms; Maximum-flow; Minimum- 
cut; Segmentation; Stereo- vision; Multi-view reconstruction; Sur- 
face fitting 



I. Introduction 

THE minimum cut problem (min-cut) and its dual, the max- 
imum flow problem (max-flow), are classical combinato- 
rial optimization problems with applications in numerous areas 
of science and engineering (for a collection of applications of 
min-cut and max-flow see d). 

Rapid advances in image acquisition and storage technology 
have increased the need for faster image processing and 
computer-vision algorithms that require lesser memory while 
being capable of handling large scale imaging problems. The 
min-cut problem takes a prominent role in many of these 
imaging problems, such as image and video segmentation, 
EL El, co- segmentation (H, stereo vision El, multi-view 
reconstruction, (51, 0, and surface fitting (8). 

Several min-cut/max-flow algorithms can be found in the 
combinatorial optimization literature. However, their perfor- 
mance in practice has been studied primarily outside the 
scope of computer vision. In this study we compare, in terms 
of execution times and memory utilization, the four leading 
published algorithms, which solve optimally the min-cut and 

B. Fishbain (barak@berkeley.edu) and D.S. Hochbaum 
(hochbaum@ieor.berkeley.edu) are with the Dept. of Industrial Engineering 
and Operational Research at the University of California, Berkeley, Etcheverry 
Hall, University of California Berkeley, CA 94720 

S. Mueller (ste.mu@arcor.de) is with the Combinatorial Optimization & 
Graph Algorithms group at the Technische Universitaet Berlin, Germany 

Manuscript received date; revised date. 



max-flow problems within the scope of vision problems. The 
study consists of a benchmark of an extensive data set which 
includes standard and non-standard vision problems (51, ifTUl . 

The algorithms compared within the scope of this study 
are: (i) the Push-Relabel, PRF, algorithm devised by Goldberg 
and Tarjan ifTTl : (ii) the Hochbaum's pseudoflow algorithm, 
HPF lfT2l : (iii) Boykov's and Kolmogorov's augmenting paths 
algorithm, BK, |[T3l : and (iv) Goldberg's partial augment- 
relabel, PAR, algorithm fl4l . 

The study of these algorithms within the scope of computer- 
vision was reported in fT3l . lfT4l . The first, [13], compares 
the BK algorithm only to PRF, and for a limited set of 
instances. The latter report, lfT4ll . used the same limited set 
of instances, and compared PRF and PAR to HPF. The 
comparison provided in |[T4l to HPF is however not valid, 
as it did not use the updated publicly available software. Here 
we provide, for the first time, a comprehensive review of all 
these algorithms and a detailed comparison of several aspects 
of their performance, including a breakdown of the run- times 
and memory requirements. The breakdown of the run-times for 
the different stages of the algorithm: initialization, minimum 
cut computation and flow recovery, is important as the logic 
of the software is allocated differently by these algorithms 
to these stages. For example, while the initialization process 
in the BK and HPF algorithms only reads the problem file 
and initiate the corresponding graphs, the implementation of 
the PRF incorporates an additional logic into this stage, e.g. 
sorting the arcs of each node. This extends the execution 
time of the initialization phase, and as a result of the entire 
algorithm. While our experiments show that this time is 
significant, it was disregarded in the previous reports in which 
the initialization execution time was not considered as part of 
the algorithm's running times. In addition, for many computer- 
vision applications only the min-cut solution is of importance. 
Thus, there is no need to recover the actual maximum flow in 
order to solve the problem. The breakdown of the execution 
times allows to evaluate the performance of the algorithms for 
these relevant computations by taking into account only the 
initialization and minimum-cut times. 

Our results demonstrate that while the BK algorithm is more 
suited for small problem instances or for problems with short 
paths from s to t, the HPF algorithm, is more suited for large 
general problem instances and utilizes less memory than the 
other algorithms on all problem instances investigated. 

The paper is organized as follows: Section [TT] describes the 
algorithms compared in this study. The experimental setup is 
presented in Section [TTT1 followed by the comparison results, 
which are detailed in Section [IVJ Section [V] concludes the 
paper. 



A. A graph representation of a vision problem 

A vision problem is typically presented on an undirected 
graph G = (V,E), where V is the set of pixels and E are 
the pairs of adjacent pixels for which similarity information 
is available. The 4-neighbors setup is a commonly used 
adjacency rule with each pixel having 4 neighbors - two along 
the vertical axis and two along the horizontal axis. This set-up 
forms a planar grid graph. The 8-neighbors arrangement is also 
used, but then the planarity of the graph is no longer preserved, 
and complexity of various algorithms increases, sometimes 
significantly. Planarity is also not satisfied for 3-dimensional 
images or video. In the most general case of vision problems, 
no grid structure can be assumed and thus the respective graphs 
are not planar. Indeed, the algorithms presented here do not 
assume any specific property of the graph G - they work for 
general graphs. 

The edges in the graph representing the image carry sim- 
ilarity weights. There is a great deal of literature on how to 
generate similarity weights, and we do not discuss this issue 
here. We only use the fact that similarity is inversely increasing 
with the difference in attributes between the pixels. In terms of 
the graph, each edge {i, j} is assigned a similarity weight Wij 
that increases as the two pixels i and j are perceived to be more 
similar. Low values of w^ are interpreted as dissimilarity. 
In some contexts one might want to generate dissimilarity 
weights independently. In that case each edge has two weights, 
w^ for similarity, and w^ for dissimilarity. 

B. Definitions and Notation 

Let G st be a graph (V stj A st ), where V st = V U {s,t} and 
A st = A\jA s yjA t in which A s and A t are the source-adjacent 
and sink- adjacent arcs respectively. The number of nodes \V st \ 
is denoted by n, while the number of arcs \A st \ is denoted by 
m. A flow / = {fij}(i,j)eA st * s sa ^ t0 defeasible if it satisfies 

(i) Flow balance constraints: for each j G V, 

52(i,j)eA at fij = J2 (j , k )eA s Jjk (i.e., inflow(j) = 
outflow(j)), and 

(ii) Capacity constraints: the flow value is between the lower 
bound and upper bound capacity of the arc, i.e., £ij < 
fij < Uij. We assume henceforth w.l.o.g that £ij = 0. 

The maximum flow or max-flow problem on a directed 
capacitated graph with two distinguished nodes — a source and 
a sink — is to find a feasible flow /* that maximizes the amount 
of flow that can be sent from the source to the sink while 
satisfying flow balance constraints and capacity constraints. 

A cut is a partition of nodes S U T = V with s G 
S,t G T. Capacity of a cut is defined by u(S,T) = 
J2ies jeT d j)eA u ij- The minimum s-t cut problem, hence- 
forth referred to as the min-cut problem, defined on the above 
graph, is to find a bi-partition of nodes — one containing the 
source and the other containing the sink — such that the sum 
of capacities of arcs from the source set to the sink set is 
minimized. In 1956, Ford and Fulkerson [15] established the 
max-flow min-cut theorem, which states that the value of a 
max-flow in any network is equal to the value of a min-cut. 

Given a capacity-feasible flow, hence a flow that satisfies 
(ii), an arc (i,j) is said to be a residual arc if (i,j) G A st 



and fij < u^ or (j,i) G A st and fa > 0. For (i, j) G A st , 
the residual capacity of arc (i,j) with respect to the flow / 
is q ■ = u^ — fa , and the residual capacity of the reverse arc 
(j, i) is Cj { = fa. Let A? denote the set of residual arcs with 
respect to flow / in G st which consists of all arcs or reverse 
arcs with positive residual capacity. 

A preflow is a relaxation of a flow that satisfies capacity 
constraints, but inflow into a node is allowed to exceed the 
outflow. The excess of a node v G V is the inflow into that 
node minus the outflow denoted by e(v) = ^Zr u v )eA st fuv — 
^2(v w)eA st fvw- Thus a preflow may have nonnegative excess. 

A pseudoflow is a flow vector that satisfies capacity con- 
straints but may violate flow balance in either direction (inflow 
into a node needs not to be equal outflow). A negative excess 
is called a deficit. 

II. Min-cut / Max-flow Algorithms 
A. The push-relabel Algorithm 

In this section, we provide a sketch of a straightforward im- 
plementation of the algorithm. For a more detailed description, 
see Q, EQ. 

Goldberg's and Tarajen's push-relabel algorithm ifTTIl . PRF, 
works with preflows, where a node with strictly positive excess 
is said to be active. Each node v is assigned a label £(v) that 
satisfies (i) £(t) = 0, and (ii) £(u) < £(v) + l if (u, v) G A f . A 
residual arc (u,v) is said to be admissible if £(u) = £{v) + 1. 

Initially, s's label is assigned to be n, while all other nodes 
are assigned a label of 0. Source-adjacent arcs are saturated 
creating a set of source-adjacent active nodes (all other nodes 
have zero excess). An iteration of the algorithm consists of 
selecting an active node in V, and attempting to push its excess 
to its neighbors along an admissible arc. If no such arc exists, 
the node's label is increased by 1. The algorithm terminates 
with a maximum preflow when there are no active nodes with 
label less than n. The set of nodes of label n then forms 
the source set of a minimum cut and the current preflow is 
maximum in that it sends as much flow into the sink node as 
possible. This ends Phase 1 of the algorithm. In Phase 2, the 
algorithm transforms the maximum preflow into a maximum 
flow by pushing the excess back to s. In practice, Phase 2 is 
much faster than Phase 1. A high-level description of the PRF 
algorithm is shown in Figure [T] 

The generic version of the PRF algorithm runs in 0(n 2 m) 
time. Using the dynamic trees data structure of Sleator and 
Tarjan |[T6lL the complexity is improved to 0(nm log ^-) 
ifTTI . Two heuristics that are employed in practice significantly 
improve the run-time of the algorithm: Gap relabeling and 
Global relabeling (see IfTTIl . IfTTI for details). 

In the highest label and lowest label variants, an active node 
with highest and lowest labels respectively are chosen for 
processing at each iteration. In the FIFO variant, the active 
nodes are maintained as a queue in which nodes are added 
to the queue from the rear and removed from the front for 
processing. In practice the FIFO - highest label variant is 
reported to work best IfTTIl . This variant of the algorithm is 
also referred to as HI_PR. While, in this paper the highest 
label variant was used, it is referred to as PRF to indicate that 
this is the PRF algorithm. 



/* 

Generic push-relabel algorithm for maximum flow. 
*/ 

procedure push-relabel (V s t, A st ,c): 
begin 

Set the label of s to n and that of all other nodes to 0; 
Saturate all arcs in A s \ 

while there exists an active node u £ V of label less than n do 
if there exists an admissible arc (u, v ) do 

Push a flow of mm{e(u) , cl v } along arc {u,v)\ 
else do 

Increase label of u by 1 unit; 
end 



Fig. 1. High-level description of Phase I of the generic push-relabel algorithm. The nodes with label equal to n at termination form the source set of the 
minimum cut. 



B. The Hochbaum 's Pseudo-flow Algorithm 

The Hochbaum 's Pseudoflow algorithm, HPF, [12] was 
motivated by an algorithm of Lerchs and Grossman |[T8l for 
the maximum closure problem. The pseudoflow algorithm 
has a strongly polynomial complexity of 0(nm log ^-) |[T9l . 
Hochbaum 's algorithm was shown to be fast in theory (Y2\ 
and in practice ifTTll for general benchmark problems. 

Each node in v £ V is associated with at most one current 
arc, currArc(v) = (w,v), in A^ \ the corresponding current 
node of v is denoted by currNode(v) = w. The algorithm 
also associates with each node with a root that is defined 
constructively as follows: starting with node v, generate the 
sequence of nodes {v,vi,V2,...,v r } defined by the current 
arcs (vi,v),(v2,vi),..., (v r , v r -i) until v r has no current arc. 
Such root node v r always exists ifTTlL |[T9l . Let the unique root 
of node v be denoted by root(v). Note that if node v has no 
current arc, then root(v) = v. 

The HPF algorithm is initiated with any arbitrary initial 
pseudoflow (i.e, flow vector that may violate flow balance 
in either direction) that saturates source adjacent and sink- 
adjacent arcs. Such initial pseudoflow can be generated, for 
example, by saturating all source-adjacent and sink- adjacent 
arcs, A s U At, and setting all other arcs to have zero flow. 
This creates a set of source-adjacent nodes with excess, and 
a set of sink- adjacent nodes with deficit. All other arcs have 
zero flow, and the set of initial current arcs is empty. Thus, 
each node is a singleton component of the forest for which it 
serves as a tree and the root of the tree. 

The algorithm associates each node v £ V with a distance 
label d(y). A residual arc (w,v) is said to be admissible if 
d(w) = d(v) + 1. 

A node is said to be active if it has strictly positive excess. 
Given an admissible arc (w, v) with nodes w and v in different 
components, an admissible path is the path from root(w) to 
root(v) along the set of current arcs from root(w) to w, the 
arc (w,v), and the set of current arcs (in the reverse direction) 
from v to root(v). 

An iteration of the HPF algorithm consists of choosing an 
active component, with root node label < n and searching 



for an admissible arc from a lowest labeled node w in this 
component. Choosing a lowest labeled node for processing 
ensures that an admissible arc is never between two nodes of 
the same component. 

By construction (see lfl2l ). the root is the lowest labeled 
node in a component and node labels are non-decreasing with 
their distance from the root of the component. Thus, all the 
lowest labeled nodes within a component form a sub-tree 
rooted at the root of the component. Once an active component 
is identified, all the lowest labeled nodes within the component 
are examined for admissible arcs by performing a depth-first- 
search in the sub-tree starting at the root. 

If an admissible arc (w, v) is found, a merger operation is 
performed. The merger operation consists of pushing the entire 
excess of root(iu) towards root(v) along the admissible path 
and updating the excesses and the arcs in the current forest. 
A schematic description of the merger operation is shown in 
Figure [2] The pseudocode is given in Figure [3] 




(a) 



(b) 



(c) 



Fig. 2. (a) Components before merger (b) Before pushing flow along 
admissible path from ri to rj (c) New components generated when arc (w, v) 
leaves the current forest due to insufficient residual capacity. 

If no admissible arc is found, d(w) is increased by 1 unit 
for all lowest label nodes w in the component. The algorithm 
terminates when there are no active nodes with label < n. 
At termination all n labeled nodes form the source set of the 
min-cut. 



The active component to be processed in each iteration can 
be selected arbitrarily. There are two variants of the pseud- 
oflow algorithm: (i) the lowest label pseudoflow algorithm, 
where an active component with the lowest labeled root is 
processed at each iteration; and (ii) the highest label algorithm, 
where an active component with the highest labeled root node 
is processed at each iteration. 

The first stage of HPF terminates with the min-cut and a 
pseudoflow. The second stage converts this pseudoflow to a 
maximum feasible flow. This is done by flow decomposition. 
Hence representing the flow as the sum of flows along a 
set of s-t paths, and flows along a set of directed cycles, 
such that no two paths or cycles are comprised of the same 
set of arcs ( l20lh pages 79-83). This stage can be done in 
0(m log n) by flow decomposition in a related network, |[T2l . 
Our experiments, like the experiments in ifTTl . indicate that 
the time spent in flow recovery is small compared to the time 
to find the min-cut. 



C. Boykov 's and Kolmogorov 's Augmenting Paths Algorithm 

Boykov's and Kolmogorov 's augmenting paths algorithm, 
BK, |[T3l attempts to improve on standard augmenting path 
techniques on graphs in vision. Given that \C\ is the capacity 
of a minimum cut, the theoretical complexity of this algorithm 
is 0(mn 2 \C\). Similarly to Ford-Fulkerson's algorithm |[T5l . 
the BK algorithm's complexity is only pseudo-polynomial. In 
this it differs from the other algorithms studied here, all of 
which have strongly polynomial time complexity. Despite of 
that, it has been demonstrated in [13] that in practice on a set 
of vision problems, the algorithm works well. 

At heart of the augmenting paths approach is the use of 
search trees for detecting augmenting paths from s to t. Two 
such trees, one from the source, Ts, and the other from the 
sink, Tt are constructed, where Ts HTt = 0. The trees are 
constructed so that in Ts all edges from each parent node to 
its children are non-saturated and in T T , edges from children 
to their parents are non-saturated. 

Nodes that are not associated with a tree are called free. 
Nodes that are not free can be tagged as active or passive. 
Active nodes have edges to at least one free node, while 
passive nodes have no edges connecting them to a free node. 
Consequentially trees can grow only by connecting, through a 
non-saturated edge, a free node to an active node of the tree. 
An augmenting path is found when an active node in either of 
the trees detects a neighboring node that belongs to the other 
tree. 

At the initialization stage the search tree, Ts contains only 
the source node, s and the search tree Tt contains only the 
sink node t. All other nodes are free. 

Each iteration of the algorithm consists of the following 
three stages: 

Growth In this stage the search trees Ts and Tt expand. For 
all active nodes in a tree, Ts or Tt, adjacent free nodes, which 
are connected through non-saturated edge, are searched. These 
free nodes become members of the corresponding search tree. 
The growth stage terminates when the search for an active 
node from one tree, finds an adjacent (active) node that belongs 



to the other tree. Thus, an augmenting path from S to T was 
found. 

Augmentation Upon finding the augmenting path, the maxi- 
mum flow possible is being pushed from s to t. This implies 
that at least one edge will be saturated. Thus, for at least one 
node in the trees Ts and Tt the edge connecting it to its 
parent is no longer valid. The augmentation phase may split 
the search trees Ts and Tt into forests. Nodes for which the 
edges connecting them to their parent become saturated are 
called orphans. 

Adoption In this stage the tree structure of Ts and Tt is 
restored. For each orphan, created in the previous stage, the 
algorithm tries to find a new valid parent. The new parent 
should belong to the same set, Ts or Tt, as the orphan node 
and has a non- saturated edge to the orphan node. If no parent 
is found, then the orphan node and all its children become 
free and the tree structure rooted in this orphan is discarded. 
This stage terminates when all orphan nodes are connected to 
a new parent or are free. 

The algorithm terminates when there are no more active 
nodes and the trees are separated by saturated edges. Thus, the 
maximum flow is achieved and the corresponding minimum- 
cut is S = T s and T = T T . 

It is interesting to note that there are two speed-ups for the 
BK- algorithm. The first one is an option to reuse search trees 
from one maxflow computation to the next as described in 
ll2l1 . This option does not apply in our setting as the instances 
are not modified. The other speed-up is due to capacity scaling 
ll22l . We would have liked to test this version but we are not 
aware of any publicly available implementation. 

D. The Partial Augment-Relabel 

The Partial Augment-Relabel algorithm, PAR, devised by 
Goldberg, lTl4l searches for the shortest augmenting path and 
it maintains a flow (rather than a pseudoflow or preflow). A 
relabeling mechanism is utilized by the algorithm to find the 
augmenting paths. 

The algorithm starts at s and searches for admissible, non- 
saturated, arcs in a depth-first search manner. An arc (x,y) 
is admissible if the label of its associated nodes is equal, 
d{x) = d(y). At each iteration, the algorithm maintains a 
path from s to v G V and tries to extend it. If v has an 
admissible arc, (v,w), the path is extended to w. If no such 
admissible arc is found, the algorithm shrinks the path, making 
the predecessor of v on the path the current node and relabels 
v. At each iteration, the search terminates either if w = t, or 
if the length of the path reaches some predefined value, k, or 
if v, the current node has no outgoing admissible arcs. For 
k = Q(y/m) 9 PAR has a complexity of 0(n 2 ^/m) O. 

In order to achieve better performance in practice, the same 
gap and global heuristics mentioned in Section III-A1 for PRF, 
can be applied here for the PAR algorithm. 

III. Experimental Setup 

The PRF, HPF and the BK algorithms are compared here by 
running them on the same problem instances and on the same 
hardware setup. The run-times of the highest level variant of 



/* 

Min-cut stage of HPF algorithm. 
*/ 

procedure HPF (V st ,A sU c)\ 
begin 

Simplelnit (A 8 ,A t , c); 

while 3 an active component T with root r, where d(r) < n, do 
w <— r; 
while W 7^ do 

if 3 admissible arc (w, v) do 

Merger (root(iu), • • • , w, v, • • • , root(v)); 
w <- 0; 
else do 

if By E T : (current (2/) = w) A (c%) = d(w)) do 

w ^ y\ 
else do {relabel} 

d(w) <- d(w) + 1; 
iu ^— current (w); 
end 



Fig. 3. The min-cut stage of the HPF algorithm. At termination all nodes in label-n components are the source set of the min-cut. 



the PRF algorithm and of PAR are reported in |[T4l for a subset 
of the problems used here. Since the source code for the PAR 
implementation is not made available, the PAR performance is 
evaluated here through the speedup factor of PAR with respect 
to the highest level variant of the PRF algorithm for each 
instance reported in the above paper. 

As suggested by Chandran and Hochbaum ifTTl we use 
the highest label version for the HPF algorithm. The latest 
version of the code (version 3.23) is available at (23). The 
highest level variant of the PRF algorithm is considered to 
have the best performance in practice 1241 . We use the highest- 
level PRF implementation Version 3.5, [25]. Note that the 
latest implementation of the Push-Relabel method is actually 
denoted by HI_PR, which indicates that the highest-label 
version is used. We refer to it as PRF, to indicate that it 
is the same algorithm which was reported in l24ll . For the 
BK algorithm, a library implementation was used l26l . In 
order to utilize the library for solving problems in DIMACS 
format, a wrapping code, wrapper, was written. This wrapper 
reads the DIMACS files and calls the library's functions for 
constructing and solving the problem. The part that reads the 
DIMACS files, under the required changes, is similar to the 
code used in the HPF implementation. One should note that the 
compilation's setup and configuration of the library have great 
effect on the actual running times of the code. In our tests the 
shortest running times were achieved using the following com- 
pilation line g++ -w -04 -o <output_f ile_name> 
-DNDEBUG -DBENCHMARK graph . cpp maxflow.cpp 
<wrapper_implementation_f ile>. 

Every problem instance was run 5 times and we report the 
average time of the three runs. These are reported for the three 
different stages of the algorithm (Initialization, Compute Min- 
Cut and Flow recovery). As detailed in section H breaking 



down the run-times provides insight into the algorithms' 
performance and allows for better comparison. Since for many 
computer-vision applications only the min-cut solution is of 
importance (e.g. 0, 0-0, E3-(36|]), the most relevant 
evaluation is of the initialization and min-cut times. 



A. Computing Environments 

Our experiments were conducted on a machine with x86_64 
Dual-Core AMD Opteron(tm) Processor at 2.4 GHz with 1024 
KB level 2 cache and 32 GB RAM. The operating system was 
GNU/Linux kernel release 2.6. 18-53. el. The code of all three 
algorithms, PRF, HPF and BK, was compile with gcc 4.1.2 
for the x86_64-redhat-linux platform with —04 optimization 
flag. 

One should note that the relatively large physical memory 
of the machine allows one to avoid memory swaps between 
the memory and the swap-file (on the disk) throughout the 
execution of the algorithms. Swaps are important to avoid 
since when the machine's physical memory is small with re- 
spect to the problem's size, the memory swap operation might 
take place very often. These swapping times, the wait times 
for the swap to take place, can accumulate to a considerably 
long run-times. Thus, in these cases, the execution times are 
biased due to memory constraints, rather than measuring the 
algorithms' true computational efficiency. Therefore we chose 
large physical memory which allows for more accurate and 
unbiased evaluation of the execution times. 

B. Problem Classes 

The test sets used consist of problem instances that arise as 
min-cut problems in computer vision, graphics, and biomedi- 
cal image analysis. All instances were made available from 



the Computer Vision Research Group at the University of 
Western Ontario (9). The problem sets used are classified into 
four types of vision tasks: Stereo vision, Segmentation, Multi- 
view reconstruction; and Surface fitting. These are detailed in 
sections IIII-B1I through IIII-B3I The number of nodes n and 
the number of arcs m for each of the problems are given in 
Table U 

1) Stereo Vision: Stereo problems, as one of the classical 
vision problems, have been extensively studied. The goal of 
stereo is to compute the correspondence between pixels of 
two or more images of the same scene, we use the Venus, 
Sawtooth (5) and the Tsukuba ll28l data- sets. These sequences 
are made up of piecewise planar objects. Each of the stereo 
problems, used in this study, consists of an image sequence, 
where each image in the sequence is a slightly shifted version 
of its preceding one. A corresponding frame for each sequence 
is given in Figure [4] 

Often the best correspondence between the pixels of the 
input images is determined by solving a min-cut problem 
for each pair of images in the set. Thus in order to solve 
the stereo problem, one has to solve a sequence of min-cut 
sub-problems all of approximately the same size. Previously 
reported run-times of these stereo problem IT3l . |[T4ll disclosed, 
for each problem, only the summation of the run-times of its 
min-cut sub-problems. Presenting the summation of the run- 
times of the sub-problems as the time for solving the entire 
problem assumes linear asymptotic behaviour of the run-times 
with respect to the input size. This assumption has not been 
justified. The run-times here, for the stereo problems, are 
reported as the average time it takes the algorithm to solve 
the min-cut sub-problem. 

Each of the stereo min-cut sub-problems aims at matching 
corresponding pixels in two images. The graphs consist of 
two 4-neighborhood grids, one for each image. Each node, 
on every grid, has arcs connecting it to a set of nodes on the 
other grid. For each of the stereo problems there are two types 
of instances. In one type, indicated by KZ2 suffix, each node 
in one image is connected to at most two nodes in the other 
image. In the second type, indicated by BVZ suffix, each node 
in one image is connected to up to five nodes in the second 
image. 





Sawtooth Venus 

Fig. 4. Stereo test sequences (source (9)) 



Tsukuba 



2) Multi-view reconstruction: A 3D reconstruction is a 
fundamental problem in computer vision with a significant 
number of applications (for recent examples see 0, (7), l37l ). 
Specifically, graph theory based algorithms for this problem 
were reported in l38l - l40l .The input for the multi-view recon- 
struction problem is a set of 2D images of the same scene taken 



from different perspectives. The reconstruction problem is to 
construct a 3D image by mapping pixels from the 2D images 
to voxels complex in the 3D space. The most intuitive example 
for such a complex would be a rectangular grid, in which the 
space is divided into cubes. In the examples used here a finer 
grid, where each voxel is divided into 24 tetrahedral by six 
planes each passing through a pair of opposite cube edges, is 
used (See [38] for details). Two sequences are used in this 
class, Camel and Gargoyle. Each sequence was constructed in 
three different sizes (referred to as small, middle and large) 
EH . Representing frames are presented in Figured] 





Camel 



Gargoyle 



Fig. 5. Multi-view test sequences (source (9)) 

3) Surface fitting: 3D reconstruction of an object's surface 
from sparse points containing noise, outliers, and gaps is also 
one of the most interesting problems in computer vision. 
Under this class we present a single test instance, "Bunny" 
(see Fig. [6]), constructed in three different sizes. The sequence 
is part of the Stanford Computer Graphics Laboratory 3D 
Scanning Repository ifTOl and consists of 362, 272 scanned 
points. The goal then to reconstruct the observed object by 
optimizing a functional that maximizes the number of data 
points on the 3D grid while imposing some shape priors 
either on the volume or the surface, such as spatial occupancy 
or surface area l29l . The "bunny" corresponding graphs, on 
which the min-cut problem is solved, are characterized by 
particularly short paths from s to t [29]. 





Originallmage ReconstructedSurface 

Fig. 6. Bunny Problem Instance - Surface fitting (source flOl ) 

4) Segmentation: Under this group 4 test sets, referred 
to as "Liver", "adhead", "Babyface" and "bone" are used. 
Each set consists of similar instances which differ in the 
graph size, neighborhood size, length of the path between 
s and t, regional arc consistency (noise), and arc capacity 
magnitude 11421 . For all instances used in this group, the 
suffices n and c represent the neighborhood type and max- 
imum arc capacities respectively. For example, bone.n6.cl0 



and babyface.n26.cl00, correspond to a 6 neighborhood and a 
maximum arc capacity of 10 units and a 26 neighborhood with 
maximum arc capacity of 100 units respectively. The different 
bone instances differ in the number of nodes. The grid on 
the 3 axes x,y and z was made coarser by a factor of 2 on 
each, thus bone_xy, means that the original problem (bone) 
was decimated along the x,y axes and it is 1/4 of its original 
size; bone_xyz, means that the original problem was decimated 
along the x,y and z axes and it is 1/8 of its original size. 



IV. Results 



A. Run-times 



In this study, the comparison of the PRF's, HPF's and BK's 
run-times are indicated for the three stages of the algorithms: 
(i) initialization, ti n n\ (ii) minimum-cut, tminCuu and (iii) 
maximum-flow, t max Fiow As these data is unknown for the 
PAR algorithm, the comparison of these three algorithms with 
respect to PAR is addressed differently, by running PRF on 
our setup and deducing the PAR run-times by multiplying the 
measured PRF time by the speedup factor reported in lfT4l . 
This is explained in Section IIV-B1 

As already indicated, the most relevant times in this study 
are the times it takes each of the algorithms to complete the 
computation of the min-cut, thus t init + tminCut- These are 
graphically presented in Figure [7] and detailed in Table U The 
Slowdown Factor, reported in table U for each algorithm, for 
every problem instance, is the ratio of the time it takes the 
algorithm to complete the computation of the minimum-cut 
divided by the minimum time it took any of the algorithms to 
complete this computation. 

Figure I 31a)| presents the run-times for the stereo vision 
problem sets. The input's size, for these problems is small, 
with respect the the other problem sets. For these small 
problem instances, the BK algorithm is doing better than PRF 
(with average Slowdown factor of 2.86, which corresponds 
to average difference in the running time of 2.0 Seconds) 
and slightly better than HPF (slowdown factor of 1.24, which 
corresponds to a running time difference of 0.24 Seconds). 
For the Multi-view instances HPF presents better results than 
both algorithms with average slowdown factors of 1.46 with 
respect to BK and 3.19 with respect to PRF. These correspond 
to differences in the running times of 95 and 170 seconds 
respectively. This is illustrated in Figure I 31b)| Figure [(c)] 
shows that the BK algorithm is more suitable for solving 
the surface fitting instances. This is attributed to the fact that 
these problems are characterized by particularly short s-t paths. 
In these instances, the slowdown factors of HPR and PRF 
are 1.05 (correspond to an average difference of 9 seconds) 
and 4.06 (difference of 454 seconds). The running times 
for the Segmentation problems class are depicted in Figure 
I 31d)| There are 36 segmentation problems. In a subset of 5 
segmentation problems BK achieved shorter running times. In 
this subset the BK's average slowdown factors are 1.19 (9.24 
seconds difference) and 2.62 (106 seconds difference in the 
running time) with respect to HPF and PRF respectively. On 
the rest of the 31 segmentation problems, HPF shows shorter 
running times with slowdown factors of 1.18 (14.22 seconds 



difference) with respect to BK and 2.62 (101.39 seconds 
difference) with respect to PRF. 

A total of 51 problem instances were tested within the scope 
of this study. The HPF algorithm was shown to be better 
in 37 problem instances. The BK algorithm achieved better 
results over the other 14 problems. The average run-times and 
slowdown factors of these two subsets are given in Table HH 

TABLE II 
Average running times and slowdown factors 



PRF | HPF | BK 


HPF is better (37 problem instances) 


Ave. run-time 
Ave. slowdown 


184.87 
2.6 


72.25 
1 


99.69 
1.39 


BK is better (14 problem instances) 


Ave. run-time 
Ave. slowdown 


185.96 
3.03 


53.53 
1.18 


48.00 
1 
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(c) Surface Fitting 
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Fig. 7. Initialization and Minimum-cut run-times in seconds: |(a)| Stereo 
Problems; [(b)] Multi-view Problems; [(c)] Surface Fitting; [(d)] Segmentation 

In order to allow for the comparison of the times it takes 
each of the algorithms to complete the computation of only 
the min-cut phase, the initialization run-times are presented in 



Instance 


Run-times [Sees] 


Slowdown Factor 


Name 


Nodes 


Arcs 


PRF 


HPF 


BK 


PRF HPF BK 


Stereo 


sawtoothBVZ 


173,602 


838,635 


1.55 


0.76 


0.62 


2.50 1.23 1 


sawtoothKZ2 


310,459 


2,059,153 


4.05 


1.87 


1.52 


2.66 1.23 1 


tsukubaBVZ 


117,967 


547,699 


1.09 


0.50 


0.41 


2.66 1.22 1 


tsukubaKZ2 


213,144 


1,430,508 


3.42 


1.33 


1.06 


3.23 1.25 1 


venusBVZ 


174,139 


833,168 


1.93 


0.82 


0.66 


2.92 1.24 1 


venusKZ2 


315,972 


2,122,772 


5.12 


2.09 


1.66 


3.08 1.26 1 


Multi-View 


camel-lrg 


18,900,002 


93,749,846 


558.65 


143.96 


225.53 


3.88 ] 


L 1.57 


camel-med 


9,676,802 


47,933,324 


240.30 


67.61 


83.43 


3.55 ] 


L 1.23 


camel-sml 


1,209,602 


5,963,582 


17.16 


6.31 


6.46 


2.72 ] 


L 1.02 


gargoyle-lrg 


17,203,202 


86,175,090 


413.26 


134.77 


432.31 


3.07 ] 


L 3.21 


gargoyle-med 


8,847,362 


44,398,548 


196.26 


56.24 


226.43 


3.49 ] 


L 4.03 


gargoyle- sml 


1,105,922 


5,604,568 


12.26 


5.00 


13.86 


2.45 ] 


L 2.77 


Surface Fitting 


bunny-lrg 


49,544,354 


300,838,741 


1564.10 


305.05 


277.02 


5.56 1.10 1 


bunny-med 


6,311,088 


38,739,041 


129.05 


34.47 


32.82 


3.93 1.05 1 


bunny- sml 


805,802 


5,040,834 


10.89 


4.12 


4.03 


2.70 1.02 1 


Segmentation 


adhead.n26cl0 


12,582,914 


327,484,556 


970.74 


344.31 


407.42 


2.82 ] 


L 1.18 


adhead.n26cl00 


12,582,914 


327,484,556 


971.20 


362.49 


476.04 


2.68 ] 


L 1.31 


adhead.n6cl0 


12,582,914 


75,826,316 


219.14 


90.17 


90.38 


2.43 ] 


L 1.01 


adhead.n6cl00 


12,582,914 


75,826,316 


242.43 


103.03 


123.22 


2.35 ] 


L 1.20 


babyface.n26cl0 


5,062,502 


131,636,370 


333.91 


245.10 


250.15 


1.36 ] 


L 1.02 


babyface.n26cl00 


5,062,502 


131,636,370 


378.94 


272.26 


321.20 


1.39 ] 


L 1.18 


babyface.n6cl0 


5,062,502 


30,386,370 


103.27 


48.30 


35.30 


2.93 1.3' 


1 1 


babyface.n6cl00 


5,062,502 


30,386,370 


126.28 


58.77 


43.89 


2.88 1.3' 


I 1 


bone.n26cl0 


7,798,786 


202,895,861 


451.35 


160.73 


196.26 


2.81 ] 


L 1.22 


bone.n26cl00 


7,798,786 


202,895,861 


472.34 


168.09 


198.73 


2.81 ] 


L 1.18 


bone.n6cl0 


7,798,786 


46,920,181 


119.76 


41.02 


47.79 


2.92 ] 


L 1.17 


bone.n6cl00 


7,798,786 


46,920,181 


132.06 


43.56 


51.88 


3.03 ] 


L 1.19 


bone_subx.n26c 1 


3,899,394 


101,476,818 


209.03 


79.70 


100.23 


2.62 ] 


L 1.26 


bone_subx.n26c 1 00 


3,899,394 


101,476,818 


218.57 


81.79 


107.43 


2.67 ] 


L 1.31 


bone_subx.n6cl0 


3,899,394 


23,488,978 


64.17 


20.89 


26.57 


3.07 ] 


L 1.27 


bone_subx.n6cl00 


3,899,394 


23,488,978 


61.18 


22.06 


30.29 


2.77 ] 


L 1.37 


bone_subxy.n26c 1 


1,949,698 


50,753,434 


99.59 


39.51 


48.25 


2.52 ] 


L 1.22 


bone_subxy.n26c 1 00 


1,949,698 


50,753,434 


101.13 


40.00 


51.04 


2.53 ] 


L 1.28 


bone_subxy.n6c 1 


1,949,698 


11,759,514 


27.22 


10.04 


12.09 


2.71 ] 


L 1.20 


bone_subxy.n6c 1 00 


1,949,698 


11,759,514 


27.66 


10.56 


13.62 


2.62 ] 


L 1.29 


bone_subxyz.n26c 1 


983,042 


25,590,293 


48.07 


18.89 


23.69 


2.54 ] 


L 1.25 


bone_subxyz.n26c 1 00 


983,042 


25,590,293 


48.83 


19.39 


25.41 


2.52 ] 


L 1.31 


bone_subxyz.n6c 1 


983,042 


5,929,493 


11.95 


4.85 


5.95 


2.46 ] 


L 1.23 


bone_subxyz.n6c 1 00 


983,042 


5,929,493 


12.05 


5.14 


6.54 


2.34 ] 


L 1.27 


bone_subxyz_subx.n26c 1 


491,522 


12,802,789 


22.96 


9.36 


11.27 


2.45 ] 


L 1.20 


bone_subxyz_subx.n26c 1 00 


491,522 


12,802,789 


23.55 


9.52 


11.55 


2.47 ] 


L 1.21 


bone_subxyz_subx.n6c 1 


491,522 


2,972,389 


5.47 


2.37 


2.75 


2.31 ] 


L 1.16 


bone_subxyz_subx.n6c 1 00 


491,522 


2,972,389 


5.56 


2.47 


2.89 


2.25 ] 


L 1.17 


bone_subxyz_subxy.n26c 1 


245,762 


6,405,104 


10.99 


4.63 


5.60 


2.37 ] 


L 1.21 


bone_subxyz_subxy.n26c 1 00 


245,762 


6,405,104 


11.06 


4.74 


5.79 


2.33 ] 


L 1.22 


bone_subxyz_subxy.n6c 1 


245,762 


1,489,904 


2.55 


1.14 


1.34 


2.24 ] 


L 1.18 


bone_subxyz_subxy.n6c 1 00 


245,762 


1,489,904 


2.59 


1.21 


1.39 


2.14 ] 


L 1.15 


Iiver.n26cl0 


4,161,602 


108,370,821 


272.63 


123.61 


112.40 


2.43 l.K 


) 1 


Iiver.n26cl00 


4,161,602 


108,370,821 


297.88 


132.48 


128.60 


2.32 i.o: 


5 1 


liver. n6c 10 


4,161,602 


25,138,821 


82.11 


35.24 


32.02 


2.56 l.K 


) 1 


Iiver.n6cl00 


4,161,602 


25,138,821 


96.38 


40.36 


43.60 


2.39 ] 


L 1.08 



TABLE I 

Vision problems: Graph sizes with combined Initialization and Minimum-cut run-times and their corresponding speedup factors. 

Each problem's fastest run-time is set in boldface. The speedup factor states how much an algorithm runs compared to the 

fastest algorithm 



Figure [8] and detailed in Appendix El Tables HTH - ED Ideally 
one should be able to evaluate the minimum-cut processing 
times by subtracting the initialization times in Tables [TTT1 - [VJ 



from the corresponding times in Table HI However, as described 
in Section [U while the BK and HPF algorithms only read the 
problem's data and allocate memory, the PRF algorithm has 



10 



some additional logic in its initialization phase. Consequen- 
tially, one can not evaluate PRF's min-cut processing times by 
this substraction. To accomplish that, one has to account for 
the time it takes the PRF algorithm to compute the additional 
logic implemented with the initialization stage. Figure [8] shows 
that for all problem instances, the PRF's initialization times 
(Unit) are 2 — 3 times longer than BK's and HPF's times. 
While these times were excluded from the total execution 
times reported in fTT3Tl and |[T4lh Figure [8] strongly suggests 
that these initialization times are significant with respect to to 
the min-cut computation times (tminCut) and should not be 
disregarded. 
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(d) Segmentation 
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(b) Multi-View 
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(c) Surface Fitting 
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Fig. 8. Initialization run-times in seconds: [(a)] Stereo Problems; [(b)] Multi- 
view Problems; |(c)| Surface Fitting; [(d)] Segmentation 

The actual maximum-flow plays a less significant role 
in solving computer vision problems. Yet, for the sake of 
completeness, the maximum flow computation times of the 
algorithms (U nit + tminCut + t max Flow) are reported in Figure 
0and in Tables [VIIl-[Xl 

B. Comparison to Partial Augment-Relabel 

The PAR run-times, on our hardware setup, are deduced 
from the speedup factor for PAR with respect to the highest 
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(b) Multi-View 
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(c) Surface Fitting 






bunny-sml 






BK 


bunny-med 






■ HPF 


bunny-lrg ™ 






PRF 


500 1000 1500 



Iiver.n6cl00 

Iiver.n6cl0 

Iiver.n26cl00 

Iiver.n26cl0 

bone.n6cl00 

bone.n6cl0 

bone.n26cl00 

bone.n26cl0 

bone.xyz.n6cl00 

bone.xyz.n6cl0 

bone.xyz.n26cl00 

bone.xyz.n26cl0 

bone.xyz.xy.n6cl0 

bone.xyz.xy.n6cl0 

bone.xyz.xy.n26cl 

bone.xyz.xy.n26cl 

bone.xyz.x.n6cl00 

bone.xyz.x.n6cl0 

bone.xyz.x.n26cl0 

bone.xyz.x.n26cl0 

bone.xy.n6cl00 

bone.xy.n6cl0 

bone.xy.n26cl00 

bone.xy.n26cl0 

bone.x.n6cl00 

bone.x.n6cl0 

bone.x.n26cl00 

bone.x.n26cl0 

babyface.n6cl00 

babyface.n6cl0 

babyface.n26cl00 

babyface.n26cl0 

adhead.n6cl00 

adhead.n6cl0 

adhead.n26cl00 

adhead.n26cl0 



t 


■ 


*-r | 


— 


L. 


L. 


— ' 


■p" 






L 


L 






, 






BK 


HPF 


PRF 


, 


, 


L 


L 


L 


L 


- 


- 


hi 


u 


WmWm 


-m- 


K 


E 







500 



1000 



Fig. 9. Initialization, Minimum cut and Maximum Flow run-times in 
seconds: [(a)! Stereo Problems; |(b)|Multi-view Problems; [(c)] Surface Fitting; 
|(d)| Segmentation 



level variant of the PRF which are reported in [14]. The paper 
above reports only the summation of the min-cut and max- 
flow run-times (without initialization), t min Cut + t maxF i ow . 
Therefore, to enable a fair comparison we use the min-cut and 
max-flow run-times of the other algorithms as well. For tp AR 
and tp RF , the run-times reported in [ 14] for the PAR and PRF 
algorithms respectively, the estimated run-time of PAR, ipAR, 
on our hardware is: 

v minCut * maxFlow ) 



tpAR 



t 



PAR 



V 



PRF 



where t m f^ Cut and t m ^ Flow are the corresponding run-times 
of the PRF algorithm measured on the hardware used in this 
study. 

The comparison results are given in Figure [K)| for all prob- 
lem instances reported in lfT4l . As reported in fT4l . the PAR 
algorithm indeed improves on PRF. HPF outperforms PAR for 
all problem instances. It is noted that in this comparison of the 
run-times that exclude initialization PAR's performance is still 
inferior to that of HPF. If one were to add the initialization 



time then the relative performance of PAR as compare to HPF 
would be much worse since the initialization used has time 
consuming logic in it as note previously in Section U and 
is shown in Figure [8] In terms of comparing PAR to BK, 
Figure [K)| shows that PAR is inferior to BK for small problem 
instances, but performs better for larger instances. 
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(c) Segmentation 
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(b) Multi-View 
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Fig. 10. PAR, PRF, HPF, BK Minimum cut run-times in seconds: [(a)] Stereo 
Problems; |(b)|Multi-view Problems; |(c)| Segmentation 



C. Memory Utilization 

Measuring the actual memory utilization is of growing 
importance, as advances in acquisition systems and sensors 
allow higher image resolution, thus larger problem sizes. 

The memory utilization of each of the algorithms is a 
result of two factors: (i) the data each algorithm maintains 
in order to solve the problem. For example, the BK algorithm 
must maintain a flow /, the list of active nodes and a list 
of all orphans (see Section III-Q : and (ii) the efficiency 
of the specific implementation's memory allocation in each 
implementation. The first factor can be analytically assessed by 
carefully examining the algorithm. The latter, however, must 
be evaluated empirically. It is important to note that both do 
not necessarily grow linearily with the problem size. The mem- 
ory usage was read directly out of the /prod [process]/ statm 
file for each implementation and for each problem instance. 
One should note that the granularity of the information in this 
file is the page-file size, thus 4096 Bytes. 

Figure [TT] summarizes the results of the memory utilization 
for BK (solid blue line), HPF (dashed green line) and PRF 



(dotted red line) algorithms. These are detailed in Appendix 
iBl Table |XI] The X-axis in Figure [TT] is the input size. A 
Problem's input size is the number of nodes, n plus the 
number of arcs, m, in the problem's corresponding graph: 
input size = n + m. The number of nodes, n, and the number 
of arcs, m, for each of the problems are given in Table U The 
Y-axis is the memory utilization in Mega- Bytes. 

Both BK and PRF algorithms use on average 10% more 
memory than the HPF algorithm. For problem instances with 
large number of arcs, the PRF and BK require 25% more 
memory. This becomes critical when the problem size is con- 
siderably large, with respect to the machine's physical mem- 
ory. In these cases the execution of the algorithms requires 
a significant amount of swapping memory pages between the 
physical memory and the disk, resulting in longer execution 
times. 
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Fig. 11. Memory Utilization Vs. Input size 



D. Summary 

Figure [12] is a graphical summary of the run-times of each of 
the algorithms for the min-cut task {tinit^minCut) depending 
on the problem size. Figure [T2l and tables U and [TT] suggest that 
the BK and the HPF algorithms generate comparable results. 
The the first, BK, is more suited for small problem instances 
(less then 1,000,000 graph elements (#Nodes + #Arcs)) or 
for instances that are characterized by a short paths from s 
to t. The latter, HPF, might be used for all other general 
larger problems. Figure [10] shows that this also holds for PRF's 
revised version, the partial augment-relabel (PAR) algorithms, 
for all vision problem instances examined in this study. In 
detail, out of the 51 instances BK is dominating 14 times 
with an average running time of 48 seconds on these instances. 
HPF on the other hand has an averaged running time of 54 
seconds and therefor the HPF algorithm has a slowdown factor 
of 1.18 with respect to BK. On the remaining 37 instances HPF 
is dominating with an average running time of 72 seconds. It 
takes BK 100 seconds in average to finish on theses instances, 
whitch results in a slowdown factor of 1.39 for BK with 
respect to HPF. 
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V. Conclusions 

This paper presents the results of a comprehensive compu- 
tational study on vision problems, in terms of execution times 
and memory utilization, of four algorithms, which optimally 
solve the s-t minimum cut and maximum flow problems: (i) 
Goldberg's and Tarjan's Push-Relabel; (ii) Hochbaum' s pseud- 
oflow; (iii) Boykov's and Kolmogorov's augmenting paths; and 
(iv) Goldberg's partial augment-relabel. 

The results show that the BK algorithm is more suited for 
small problem instances (less then 1, 000, 000 graph elements, 
thus vertices and arcs) or for instances that are characterized 
by short paths from s to t, the HPF algorithm is better suited 
for all other general larger problems. In terms of memory 
utilization, the HPF algorithm has better memory utilization 
with up to 25% saved in memory allocation as compared to 
BK and PRF. 

Our results are of significance because it has been widely 
accepted that both BK and PRF algorithms were the fastest 
algorithms in practice for the min-cut problem. This was 
shown not to hold in general ifTTl . and here for computer 
vision in particular. This, with the availability of HPF algo- 
rithm's source-code (see (23 j), makes HPF the perfect tool for 
the growing number of computer vision applications which 
incorporate the min-cut problem as a sub-routine. 

The current strategy of speeding up computers is to in- 
crement the number of processors instead of increasing the 
computing power of a single one. This development suggests 
that a parallelization of the algorithm would be beneficial. 
We expect the HPF algorithm to behave well with respect 
to parallel implementations as well. 
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Appendix A 
Run-times 



Stereo 


Instance 


PRF 


HPF 


BK 


sawtoothBVZ 


1.02 


0.57 


0.53 


sawtoothKZ2 


2.69 


1.40 


1.32 


tsukubaBVZ 


0.65 


0.37 


0.34 


tsukubaKZ2 


1.79 


0.96 


0.90 


venusBVZ 


1.02 


0.57 


0.53 


venusKZ2 


2.79 


1.45 


1.36 


Average 


1.66 


0.89 


0.83 



TABLE III 
Initialization stage run-times : Stereo Vision problems 



Multi-View 


Instance 


PRF 


HPF 


BK 


camel-lrg 


155.25 


76.97 


70.22 


camel-med 


76.40 


38.44 


35.15 


camel- sml 


8.66 


4.65 


4.23 


gargoyle-lrg 


141.81 


68.79 


63.50 


gargoyle-med 


70.79 


34.97 


32.33 


gargoyle- sml 


8.11 


4.33 


3.94 


Average 


76.84 


38.02 


34.89 



TABLE IV 
Initialization stage run-times : Multi-View problems 



Surface Fitting 


Instance 


PRF 


HPF 


BK 


bunny-lrg 

bunny-med 

bunny-sml 


687.01 

70.87 

7.99 


230.81 

29.25 

3.70 


219.08 

27.95 
3.47 


Average 


255.29 


87.92 


83.50 



TABLE V 
Initialization stage run-times : Surface Fitting problems 



Segmentation 


Instance 


PRF 


HPF 


BK 


adhead.n26cl0 


697.57 


238.71 


233.34 


adhead.n26cl00 


702.45 


242.09 


239.02 


adhead.n6cl0 


144.05 


58.00 


55.12 


adhead.n6cl00 


146.12 


59.82 


56.47 


babyface.n26cl0 


180.35 


94.24 


92.50 


babyface.n26c 1 00 


182.76 


95.88 


94.62 


babyface.n6cl0 


44.31 


22.77 


21.71 


babyface.n6cl00 


44.85 


23.37 


22.35 


bone.n26cl0 


381.60 


146.00 


144.64 


bone.n26cl00 


382.59 


149.93 


145.70 


bone.n6cl0 


85.74 


35.58 


33.66 


bone.n6cl00 


86.26 


36.68 


34.72 


bone subx.n26cl0 


182.47 


72.52 


71.59 


bone_subx.n26c 1 00 


183.88 


73.70 


72.54 


bone_subx.n6c 1 


41.02 


17.62 


16.78 


bone_subx . n6c 1 00 


41.46 


18.18 


17.28 


bone_subxy.n26c 1 


87.33 


36.05 


35.32 


bone_subxy.n26c 1 00 


88.08 


36.37 


35.89 


bone_subxy.n6c 1 


19.62 


8.73 


8.26 


bone_subxy . n6c 1 00 


19.77 


8.99 


8.60 


bone_subxyz.n26cl0 


41.91 


17.60 


17.31 


bone_subxyz.n26cl00 


42.26 


17.93 


17.74 


bone_subxyz.n6c 10 


9.26 


4.28 


4.10 


bone_subxyz.n6c 100 


9.29 


4.46 


4.21 


bone_subxyz_subx.n26c 1 


20.24 


8.80 


8.66 


bone_subxyz_subx.n26cl00 


20.47 


8.93 


8.76 


bone_subxyz_subx.n6c 1 


4.32 


2.16 


2.05 


bone_subxyz_subx.n6c 1 00 


4.39 


2.23 


2.10 


bone_subxy z_subxy.n26c 1 


9.63 


4.38 


4.30 


bone_subxy z_subxy.n26c 1 00 


9.68 


4.47 


4.34 


bone_subxy z_subxy.n6c 1 


2.05 


1.06 


1.02 


bone_subxyz_subxy.n6c 1 00 


2.08 


1.11 


1.05 


liver. n26c 10 


200.28 


76.43 


75.65 


Iiver.n26cl00 


200.64 


77.25 


76.02 


liver. n6c 10 


44.58 


18.68 


17.90 


Iiver.n6cl00 


44.76 


18.91 


17.99 


Average 


122.45 


48.44 


47.31 



TABLE VI 
Initialization STAGE run-times -.Segmentation problems 



Stereo 


Instance 


PRF 


HPF 


BK 


sawtoothBVZ 


1.64 


0.88 


0.62 


sawtoothKZ2 


4.26 


2.02 


1.52 


tsukubaBVZ 


1.15 


0.57 


0.41 


tsukubaKZ2 


3.55 


1.42 


1.06 


venusBVZ 


2.02 


0.94 


0.66 


venusKZ2 


5.32 


2.22 


1.66 


Average 


2.99 


1.34 


0.99 



TABLE VII 
Total run-times of the Initialization and Min-cut and Max-flow 

STAGES - Stereo Vision PROBLEMS 



Multi-View 


Instance 


PRF 


HPF 


BK 


camel-lrg 


563.52 


180.57 


225.53 


camel-med 


242.74 


80.56 


83.43 


camel- sml 


17.44 


6.92 


6.46 


gargoyle-lrg 


417.75 


154.40 


432.31 


gargoyle-med 


198.56 


66.66 


226.43 


gargoyle- sml 


12.54 


5.61 


13.86 


Average 


242.09 


82.45 


164.67 



TABLE VIII 
Total run-times of the Initialization and Min-cut and Max-flow 

STAGES - Multi-View PROBLEMS 



Surface Fitting 


Instance 


PRF HPF BK 


bunny-lrg 

bunny-med 

bunny-sml 


1595.21 440.59 277.02 

131.78 43.50 32.82 

11.18 4.74 4.03 


Average 


579.39 162.94 104.62 



Segmentation 


Instance 


PRF 


HPF 


BK 


adhead.n26cl0 


982.98 


356.07 


407.42 


adhead.n26cl00 


985.39 


383.55 


476.04 


adhead.n6cl0 


223.30 


94.54 


90.38 


adhead.n6cl00 


248.06 


110.58 


123.22 


babyface.n26cl0 


341.32 


264.41 


250.15 


babyface.n26cl00 


392.62 


325.34 


321.20 


babyface.n6cl0 
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52.74 


35.30 
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129.39 


72.35 


43.89 
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456.03 


163.58 


196.26 
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174.07 


198.73 
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121.31 


42.74 
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bone_subx.n26c 1 
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100.23 


bone_subx.n26c 1 00 
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83.42 


107.43 
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21.41 
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22.52 


30.29 
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100.65 
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bone_subxy.n6c 1 
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10.30 


12.09 


bone_subxy . n6c 1 00 
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23.69 


bone_subxyz.n26cl00 


49.39 
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5.28 


6.54 
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9.48 
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9.68 


11.55 


bone_subxyz_subx.n6c 1 
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2.43 
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bone_subxyz_subx.n6c 1 00 
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2.54 


2.89 


bone_subxy z_subxy.n26c 1 


11.12 


4.67 


5.60 


bone_subxyz_subxy.n26c 1 00 
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5.79 


bone_subxy z_subxy.n6c 1 


2.60 


1.16 


1.34 


bone_subxyz_subxy.n6c 1 00 


2.64 


1.24 


1.39 


liver. n26c 10 


275.68 


126.03 


112.40 


Iiver.n26cl00 


301.11 


135.20 


128.60 


liver. n6c 10 


83.42 


36.99 


32.02 


Iiver.n6cl00 


97.88 


42.30 


43.60 


Average 


177.25 


78.42 


84.79 



TABLE IX 
Total run-times of the Initialization and Min-cut and Max-flow 

STAGES - Surface Fitting PROBLEMS 



TABLE X 
Total run-times of the Initialization and Min-cut and Max-flow 

STAGES - Segmentation PROBLEMS 
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PRF 


HPF 


BK 


Stereo 


sawtoothBVZ 


62.28 


58.50 


69.27 


sawtoothKZ2 


141.08 


125.81 


147.55 


tsukubaBVZ 


41.62 


39.55 


48.79 


tsukubaKZ2 


97.72 


87.12 


104.57 


venusBVZ 


62.27 


58.65 


69.24 


venusKZ2 


145.76 


129.76 


152.22 


Segmentation 


adhead.n26cl0 


20,759.80 


16,480.20 


20,719.40 


adhead.n26cl00 


20,759.80 


16,480.20 


20,719.40 


adhead.n6cl0 


5,399.80 


4,960.20 


5,359.40 


adhead.n6cl00 


5,399.80 


4,960.20 


5,359.40 


babyface.n26cl0 


8,347.10 


6,628.00 


8,335.40 


babyface.n26cl00 


8,347.10 


6,628.00 


8,335.40 


babyface.n6cl0 


2,167.30 


1,993.20 


2,155.60 


babyface.n6cl00 


2,167.30 


1,993.20 


2,155.60 


bone.n26cl0 


12,863.50 


10,212.70 


12,841.30 


bone.n26cl00 
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12,841.30 


bone.n6cl0 


3,343.50 


3,072.70 


3,321.30 


bone.n6cl00 


3,343.50 


3,072.70 


3,321.30 


bone_subx.n26c 1 


6,435.30 


5,109.30 


6,428.10 


bone_subx.n26c 1 00 


6,435.30 


5,109.30 


6,428.10 


bone_subx.n6cl0 


1,675.30 


1,539.30 


1,668.10 


bone_subx.n6cl00 
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1,539.30 


1,668.10 


bone_subxy.n26c 1 


3,220.40 


2,557.04 


3,220.60 


bone_subxy.n26c 1 00 


3,220.40 


2,557.06 


3,220.60 


bone_subxy.n6c 1 


840.40 


772.00 


840.60 


bone_subxy.n6c 1 00 


840.40 


772.00 


840.60 


bone_subxyz.n26c 1 


1,625.60 


1,291.00 


1,629.40 


bone_subxyz.n26c 1 00 


1,625.60 


1,291.00 


1,629.40 


bone_subxyz.n6cl0 


425.60 


391.10 


429.40 


bone_subxyz.n6c 1 00 


425.60 


391.10 


429.40 


bone_subxyz_subx.n26c 1 


815.10 


647.60 


820.80 


bone_subxyz_subx.n26c 1 00 


815.10 


647.60 


820.80 


bone_subxyz_subx.n6cl0 


215.10 


197.70 


220.80 


bone_subxyz_subx.n6c 1 00 


215.10 


197.70 


220.80 


bone_subxyz_subxy.n26c 1 


409.60 


325.80 


416.30 


bone_subxyz_subxy.n26c 1 00 


409.60 


325.80 


416.30 


bone_subxyz_subxy.n6c 1 


109.60 


100.80 


116.30 


bone_subxyz_subxy.n6c 1 00 


109.60 


100.80 


116.30 


Iiver.n26cl0 


6,872.10 


5,455.20 


6,863.80 


Iiver.n26cl00 


6,872.10 


5,455.20 


6,863.80 


liver. n6c 10 


1,792.00 


1,645.20 


1,783.80 


Iiver.n6cl00 


1,792.00 


1,645.20 


1,783.80 


Multi-View 


camel-lrg 


6,879.30 


6,392.60 


6,814.80 


camel-med 


3,519.90 


3,272.50 


3,490.60 


camel- sml 


441.50 


411.30 


444.50 


gargoyle-lrg 


6,313.40 


5,851.80 


6,255.40 


gargoyle-med 


3,253.60 


3,015.00 


3,227.40 


gargoyle- sml 


413.30 


382.52 


416.60 


Surface Fitting 


bunny-lrg 


21,389.40 


19,600.30 


21,208.00 


bunny-med 


2,753.30 


2,515.00 


2,736.80 


bunny- sml 


360.50 


327.70 


365.10 



TABLE XI 
Memory Utilization in [MBytes] 



